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; ABSTRACT 

^: 

■ We present the first spectroscopic measurement of the spatial cross-correlation 
function between damped Lyman a systems (DLAs) and Lyman break galaxies 
(LBGs). We obtained deep m'BVRI images of nine QSO fields with 11 known 
z ~ 3 DLAs and spectroscopically confirmed 211 R < 25.5 photometrically se- 
lected z > 2 LBGs. We find strong evidence for an overdensity of LBGs near 
DLAs versus random, the results of which are similar to that of LBGs near other 
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LBGs. A maximum likelihood cross-correlation analysis found the best fit cor- 
relation length value of vq = 2.9~^l'^h~^Mpc using a fixed value of 7 = 1.6. The 
implications of the DLA-LBG clustering amplitude on the average dark matter 
halo mass of DLAs are discussed. 

Subject headings: galaxies: high redshift — quasars: absorption lines — galaxies: 
formation — galaxies: evolution 

1. Introduction 

Over the last two decades, quasar (QSO) absorption line systems have provided tremen- 
dous insight into the nature of proto- galaxies at high redshift. Of particular interest arc the 
damped Lyman a systems (DLAs) defined to have N(Hl) > 2 x 10^° atoms cm~^ (Wolfe et 
al. 1986, 2005). Systems with such large column densities can provide self-shielding from the 
ambient ionizing radiation at high redshift and protect large reservoirs of neutral gas. Such 
systems are prime sites for star formation. Moderate and high-resolution databases (e.g. 
Prochaska et al. 2001, 2003) and numerous spectroscopic studies of DLAs have given us a 
detailed view of the chemical abundances and gas kinematics of proto-galaxies, yet, except 
for two confirmed z > 2 detections of DLA emission (M0ller et al. 2002, 2004), information 
regarding their mass, luminosity, and morphology has remained elusive. 

One approach to measure the average mass of a galaxy population at high redshift is 
under the implicit assumption that galaxies are a result of the gravitational instabilities of 
primordial density fluctuations. CDM hierarchical models predict that the most massive 
galaxies at high redshift preferentially form clustered together near the density peaks of 
regions with an underlying mass over density, whereas low- mass galaxies form more uniformly 
throughout space (Kaiser 1984; Bardccn et al. 1986). The factor in which the underlying 
dark matter is enhanced in the regions where galaxies cluster as compared to that implied 
by the galaxies themselves is referred to as bias. In this context, the spatial distribution of 
a population of galaxies provides a means to infer their typical dark matter halo mass. This 
method has been used to infer the average mass of Lyman break galaxies (LBGs) at z ~ 3 
(e.g. Steidel et al. 1998; Adelberger et al. 1998) and agrees with the mass estimates from 
nebular line-width measurements (Pettini et al. 2001) and those implied by star formation 
model fits (Shapley et al. 2001). 

It is difficult to measure the mass of DLAs by their spatial clustering because of the 
sparse distribution of bright QSO sightlines and since only approximately one quarter of 
all ^ > 3 QSOs exhibit DLA absorption. However, the mass of DLAs can be inferred by 
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their cross-correlation with another known population (Gawiser et al. 2001). Since the LBG 

auto-correlation function and LBG galaxy bias at z ~ 3 has been established (Adelberger 
et al. 2003; hereafter A03), it is natural to use these galaxies as tracers of the underlying 
mass distribution to cross-correlate with 2; ~ 3 DLAs. With this in mind, A03 used the 
spectroscopic sample of Steidel et al. (2003) to test the spatial distribution of LBGs and 
DLAs. A count of the number of LBGs in three-dimensional cells centered on the four 
DLAs in their sample showed no significant overdensity. In contrast, Bouche Sz Lowenthal 
(2004) used photometric redshifts to measure the clustering of LBGs near two DLAs and 
one sub-DLA in wide-field images. Prom Monte Carlo simulations, they found a non-zero 
DLA-LBG clustering amphtude to greater than 2a and angular analysis on scales out to 
~ 20/i~^Mpc estimated the DLA-LBG cross-correlation to be equal to, or greater than, the 
LBG auto-correlation. Both analyses had limited statistics and could neither confirm, nor 
rule out, a significant overdensity of LBGs near DLAs. 

In this Letter, we highlight the results of a spectroscopic survey for LBGs associated 
with 11 DLAs at 2; ~ 3. We introduce strong evidence for an overdensity of LBGs near DLAs 
and present the first detection of the three-dimensional DLA-LBG cross-correlation function. 
A more complete discussion of the DLA-LBG cross-correlation analysis is presented in Cooke 
et al. (2005b) along with our independent measurement of the z ^ 3 LBG auto-correlation 
function. In this Letter, we adopt = 0.3, Q.a ~ 0.7 cosmology. 

2. Observations 

We acquired deep m'BVRI images from 2000 April through 2003 November of nine QSO 
fields with 11 known DLAs (2.78 < z < 3.32) using the Carnegie Observatories Spectrograph 
and Multi-Object Imaging Camera (Kells et al. 1998) on the 200" Hale telescope at Palomar 
and the Low Resolution Imager and Spectrometer (LRIS; Oke et al. 1995) on the Keck I 
telescope. The data were reduced in a standard manner. We developed a u'BVRl photomet- 
ric selection technique for LBGs at 2; ~ 3 that proved comparable to previous techniques in 
both efficiency and resulting redshift distribution. Over the 465 arcmin^ surveyed, we found 
796 R < 25.5 objects that met our color criteria. Follow-up multi-object spectroscopy of 
529 LBG candidates using LRIS yielded 339 redshifts. We identified 211 LBGs with z >2 
and used these in the cross-correlation analysis. Details of data acquisition, reduction, and 
analysis can be found in Cooke et al. (2005a). 
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3. Clustering Analysis 

3.1. Evidence of an LBG overdensity near DLAs 

As a coarse measure of the distribution of LBGs near DLAs, we divided our survey 
volume into cells with dimensions of the field area of LRIS at 2; ~ 3 (~ 7 x 10/i~^Mpc) and 
Az = 0.025 (~ 17/i-^Mpc). The choice in cell size follows that of Adelberger et al. (1998) 
and includes the majority of the objects associated with a central object having a galaxy bias 
less than or equal to the LBG bias at 2; ~ 3. The extended length in the redshift direction 
is intended to account for the ~ 1 — 2/i~^Mpc error in the systemic redshift measurement 
inherent to LBGs. 

This simple counts-in-cells analysis found an average of 1.27 objects residing in cells 
centered on each of the 11 DLAs where an average of 0.85 objects should have been found 
randomly. Random values were determined for objects in identical cells at the redshifts of 
the DLAs pulled from normalized random catalogs that mimicked the constraints of the data 
and were corrected by the photometric selection function [see Cooke et al. (2005a,b)]. This 
observed overdensity can be compared to an average of 1.16 objects found in cells of identical 
size centered on LBGs in our survey having similar redshifts to the DLAs but located in other 
fields. Interestingly, two of the 14 objects associated with the DLAs are QSOs. Since QSOs 
are believed to form in massive dark matter halos that seed supermassive black holes, this 
suggests that the corresponding DLAs reside in overdense regions. 



3.2. DLA-LBG cross-correlation function 

We measured the DLA-LBG cross-correlation function ^dla-lbg using the usual ap- 
proach of comparing galaxy pair separations in the data to galaxy pair separations in the 
random galaxy catalogs. We used the estimator of Landy & Szalay (1993) to measure the 
excess probability over random of finding an LBG at a distance r from a DLA 

^ / N DolaDlUG — Ddi \Biu(; — -RdLaDhjC + ri DLAri LUG x 

iDLA-LBG (r) = (1) 

J^DLAJ^LBG 

where DdlaDlbg is the catalog of data-data pair separations, DdlaRlbg and RdlaDlbg 
are the data-random and random-data pair separation cross-reference catalogs, and RdlaRlbg 
is the catalog of random-random pair separations. This estimator is well-suited for small 
galaxy samples and has a nearly Poisson variance. The random catalogs were constructed to 
be many times larger than the data catalog to reduce shot noise and were then normalized 
to the data. The mean LBG density was determined from the data in all 11 fields. We 
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determined ^ (r) by counting the number of pairs in each catalog over a series of log or linear 
intervals (i.e. bins). In addition, we made the assumption that ^(r) follows a power law of 
the form 



We initially measured the cross-correlation function by duplicating the cylindrical bin- 
ning technique described in AOS, Appendix C. This technique was adopted to help minimize 
the effect that LBG redshift uncertainties have on the clustering signal as compared to tra- 
ditional radial bins. In addition, this approach permitted a direct comparison of our results 
to the published LBG auto-correlation values of A03 using the available onhne dataset^ of 
Steidel et al. (2003) since both surveys were executed in a similar manner and used the same 
instruments and configurations. 

In this treatment, the expected projected angular overdensity is defined to be 



where Tz is the greater of 1000 km sec~^(l + z)/H{z) and 7rg, and B and Ix are the beta 
and incomplete beta functions with x = r1{r1 + r^)^^ (Press ct al. 1992). Applying this 
method to the DLA-LBG cross-correlation, we found best fit parameter values and 1 sigma 
uncertainties of Tq = 3.3 ± 1.3/i~"'^Mpc, 7 = 1.7 ± 0.4. Figure 1 presents and compares these 
results with the LBG auto-correlation results of A03 and is plotted in a consistent manner 
where Vmax = as described above. The errors on the cross-correlation values shown in 
the figure are those determined using the formulation in Landy & Szalay (1993) and the 
reported errors on the functional fit were determined by duplicating the Monte Carlo error 
analysis as described in A03. The latter error analysis may underestimate the true error by 
a factor of ~ 1 - 2 (Adelberger et al. 2005). 

Although the uncertainties are large, it is immediately apparent from Figure 1 that 
the form and central values of the two correlation functions are similar. In addition, we 
computed a cross-correlation length of tq = 3.5 ± 1.0/i~^Mpc for a fixed value of 7 = 1.6, 
equivalent to the value reported in A03 and Adelberger et al. (2005) for the LBG auto- 
correlation. Our decision to center the DLAs in the observed fields prevented an estimation 
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3.3. Conventional binning 
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^http://vizier.cfa.harvard.edu/viz-bin/VizieR?-source=J/Ap J/592/728/ 
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of the DLA-LBG cross-correlation effectively beyond ~ Ah'^Mpc using the above method. 
However, our cross-correlation values are consistent with the constraints placed on the DLA- 
LBG cross-correlation by Bouche & Lowenthal (2004) using a comparable analysis over a 
range of ~ 5 — 15/i~^Mpc. 

3.4. Maximum likelihood 

As an independent method of analysis, and to make the most of our dataset, we deter- 
mined the maximum likelihood of a power law fit (equation 2) to the observed data (e.g. 
Croft et al. 1997; Mullis et al. 2004). We divided the radial separations into a large number 
of finely spaced regular intervals that coincided with either 1 or LBGs. Poisson statistics 
hold in the regime of large interval number and small probability per interval. We used this 
to form the likelihood function 

l^l- ... I^J- 

where is the expected number of pairs in the ith interval, is the observed number of 
pairs for that same interval, and the index j runs over the elements where there are no 
pairs. The expected number of pairs was determined by solving equation 1 for DdlaDlbg 
over a reasonable range of Tq and 7. We then maximized the expression S — — 21n>C. 
Confidence levels were defined as /S.S — S{ro^f,est, Ihest) ~ S{ro, 7) with the assumption that 
S has a distribution. We found the best fit values and 68% confidence levels for the 
cross-correlation using this method to be tq = 2.8!^2 o^^^^P*^) 7 — 2.ll} 4 and a best fit 
value of To = 2.9l};5/i~^Mpc for a fixed 7 = 1.6. Figure 2 displays these results. 

Cooke et al. (2005b) describe the above analyses in more detail, present several tests to 
address the shortcomings of each method, and make efforts to quantify the physical effects 
that the multi-object slitmasks have on the clustering signal. A short summary of best fit 
values and la uncertainties described here and from that work is listed in Table 1. It can 
be seen that all independent methods, and tests thereof, result in consistent central values 
within their uncertainties. 

4. Discussion 

The LBG bias at 2; ~ 3, derived from the LBG auto-correlation of the R < 25.5 spectro- 
scopic sample, indicates an average LBG dark matter halo mass of ~ IO^^Mq (e.g. Steidel 
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et al. 1998; Adelberger et al. 1998). The 11 DLAs presented here comprise an unbiased 
representation of a random cross-section weighted sample. The similarity between the LBG 
auto-correlation and the DLA-LBG cross-correlation implies that the bias factors of the two 
populations are comparable and that DLAs, on average, may form in similarly massive po- 
tential wells. We consider the implications from the best fit central value tq ~ 2.9 using the 
maximum likelihood method with fixed 7 = 1.6. This measurement corresponds to a DLA 
galaxy bias of b^LA ~ 2.4 and represents an average DLA halo mass of {Mola) ~ IO^-'^-^Mq. 
This value is higher than what is predicted for DLAs by simple models (e.g. Mo et al. 
1998) but is in very good agreement with numerical models of varying resolution that invoke 
strong galactic-scale winds (Nagamine et al. 2005) and thermal feedback (Mailer et al. 2001; 
Bouche et al. 2005). In these models, galactic outflows purge low-mass halos of gas capable 
of generating damped Lyman a absorption lines and result in a higher mean mass. It must 
be noted that because CDM predicts a steeply rising mass function, it is expected that the 
median DLA halo mass is lower than the mean mass implied here. 

The observed similarity in the spatial distributions of DLAs and LBGs also helps support 
previous ideas that the two populations may be connected (Schaye 2001; M0ller et al. 2002). 
Recent analysis of the Hubble Ultra Deep Field (Chen & Wolfe 2005) shows little evidence 
for in situ star formation throughout the DLA neutral gas. However, calculations using the 
Cll* absorption feature to trace the cooling rates in DLAs (Wolfe et al. 2003) require local 
sources of radiation to heat DLAs. A plausible scenario is that LBGs are embedded in the 
same systems that contain DLAs. This model is reinforced by the near equality between 
DLA cooling rates and LBG heating rates. Furthermore, the lack of detected DLA emission 
out to reasonable impact parameters from background QSOs and the above implied average 
halo mass are consistent with DLAs sampling the bulk of the LBG population with typical 
impact parameters of < 1" and median luminosities of R > 27. 

Although the uncertainties in this initial measure of the three-dimensional DLA-LBG 
cross-correlation function are large, there remains strong evidence for an overdensity of LBGs 
near DLAs. The similarity in the central values between the DLA-LBG cross-correlation 
and the LBG auto-correlation underscore a need for additional observations to improve the 
statistical significance of these results and to provide a step toward a complete picture of 
galaxy formation and evolution. 

We would like to thank K. L. Adelberger and C. Mullis for helpful and informative 
discussions. This work was partially supported by the National Science Foundation grant 
AST-0307824 and the NSF Astronomy & Astrophysics Postdoctoral Fellowship (AAPF) 
grant AST-0201667 awarded to Eric Gawiser. 
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Fig. 1. — Measurement of the DLA-LBG cross-correlation following the binning method of 
Adelberger et al. (2003) and plotted in a consistent manner. The cross-correlation values 
are indicated by red diamonds and we find a best fit of vq = 3.3 ± 1.3/i^^Mpc, 7 = 1.7 ± 0.4 
indicated by the solid red line. The errors shown are near Poisson and the reported errors 
are where 68% of the best fit values lie from a Monte Carlo analysis of the functional fit. For 
a fixed value of 7 = 1.6, we find a best fit correlation length of tq = 3.5 ± 1.0/i~^Mpc. The 
LBG auto-correlation (blue squares) of Adelberger et al. (2003) are overlaid over a similar 
scale with the published fit of tq = 3.96 ± 0.29/;.~^Mpc, 7 = 1.55 ± 0.15 (blue dotted line). 
The DLA-LBG cross-correlation values are consistent with the angular wide-field analysis of 
Bouche & Lowenthal (2004) significant on scales of ~ 5 — 10h~^Mpc. 
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Fig. 2. — Two parameter probability contours for the DLA-LBG cross-correlation using the 
maximum likelihood method. The best fit values of vq = 2.8]^2 g^~^Mpc, 7 = 2.lt\i are 
indicated by the large red diamond. The best fit value of tq = 2.9~^\'t^h~^M'pc for a fixed 
value of 7 = 1.6 (small black diamond) is shown with the associated la uncertainty on tq. 
For comparison, the blue square and error bars indicate the LBG auto-correlation best fit 
values and la uncertainties of 4.0 ±0.6/i~^Mpc, 7 = 1.6 ±0.1 from Adelberger et al. (2005). 
Here, the angular positions of the galaxies in the random catalogs were made to be identical 
to the data to minimize possible artificial clustering effects caused by the physical constraints 
of the slitmasks. 
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Table 1. DLA-LBG Cross-Correlation Parameter Summary 



Method 


To 


7 


Conventional binning*^ ^ 


3.32 ± 1.3 


1.74 ±0.4 


Maximum likelihood''^ 


2.811^:^ 


2.1ltii 


Cumulative test'' ^ 


3.84l^J 


2.061?:^ 


Conventional binning^ ^ 


3.21 ± 1.0 


2.03 ±0.2 


Maximum likelihood'^ ^ 


3.20l|2 


1.621}:^ 


Cumulative test ^'^^ 


3.9llt^ 





^Galaxy separations binned using the cylindrical 
approach described in Adelberger et al. (2003), Ap- 
pendix C 

''Angular positions of galaxies in the random cat- 
alogs are identical to the angular positions of the 
data (to minimize possible artificial clustering ef- 
fects caused by the slitmasks) 

'^Galaxy separations binned radially 

'^Described in Cooke et al. (2005b) 

*^ Angular positions of galaxies in the random cat- 
alogs are random 



